Multifont Ottoman Character Recognition
نویسندگان
چکیده
Ottoman characters from three different fonts are used character recognition problem, broadly speaking, is transferring a page that contain symbols to the computer and matching these symbols with previously known or recognized symbols after extraction the features of these symbols via appropriate preprocessing methods. Because of silent features of the characters, implementing an Ottoman character recognition system is a difficult work. Different researchers have done lots of works for years to develop systems that would recognize Latin characters. Although almost one million people use Ottoman characters, great deal of whom has different native languages, the number of studies on this field is insufficient. In this study 28 different machine-printed to train the Artificial Neural Network and a %95 classification accuracy for the characters in these fonts and a %70 classification accuracy for a different font has been found.
منابع مشابه
Multifont Classification using Typographical Attributes
This paper introduces a multifont classification scheme to help recognition of multifont and multisize characters. It uses typographical attributes such as ascenders, descenders and serifs obtained from a word image. The attributes are used as an input to a neural network classifier to produce the multifont classification results. It can classify 7 commonly used fonts for all point sizes from 7...
متن کاملWord-level recognition of multifont Arabic text using a feature vector matching approach
Many text recognition systems recognize text imagery at the character level and assemble words from the recognized characters. An alternative approach is to recognize text imagery at the word level, without analyzing individual characters. This approach avoids the problem of individual character segmentation, and can overcome local errors in character recognition. A word-level recognition syste...
متن کاملWord Recognition With Multi-Level Contextual Knowledge
A word recognition algorithm is proposed that integrates character recognition with word shape analysis. The algorithm consists of a set of serial filters and parallel classifiers, and the decisions are combined to generate a consensus ranking of the input lexicon. Experimental results with multifont machine-printed word images are discussed.
متن کاملAn OCR System for Printed Documents
This paper describes the general structure of a full automated document analysis system for printed documents. The system is based on a character preclassification stage which reduces the number of patterns to recognize and introduces a new contextual processing. This specific approach for multifont printed documents reading is based on pattern character redundancies. With the study of prototyp...
متن کاملntegrated segmentation and recognition of onnected Ottoman script
smet Zeki Yalniz smail Sengor Altingovde ğur Güdükbay zgür Ulusoy ilkent University epartment of Computer Engineering ilkent, Ankara, 06800 urkey -mail: [email protected] Abstract. We propose a novel context-sensitive segmentation and recognition method for connected letters in Ottoman script. This method first extracts a set of segments from a connected script and determines the candi...
متن کامل